A robust glottal source model estimation technique
نویسندگان
چکیده
This paper describes a robust glottal source estimation method based on a joint source-filter separation technique. In this method, the glottal flow derivative is modelled as the Liljencrants-Fant (LF) model and the vocal tract is described as a time-varying ARX model. Since the joint estimation problem is a multi-parameter nonlinear optimization procedure, we separate the optimization procedure into two passes. The first pass initializes the glottal source and vocal tract models providing robust initial parameters to the following joint optimization procedure. The joint estimation determines the accuracy of model estimation, which is implemented with a trust-region descent optimization algorithm. Experiments with synthetic and real voices show the proposed method is a robust glottal source parameter estimation method with a considerable degree of accuracy.
منابع مشابه
Joint source-filter optimization for robust glottal source estimation in the presence of shimmer and jitter
We propose a glottal source estimation method robust to shimmer and jitter in the glottal flow. The proposed estimation method is based on a joint source-filter optimization technique. The glottal source is modeled by the Liljencrants–Fant (LF) model and the vocaltract filter is modeled by an auto-regressive filter, which is common in the source-filter approach to speech production. The optimiz...
متن کاملRobust LP analysis using glottal source HMM with application to high-pitched and noise corrupted speech
This paper presents a robust feature extraction method effective to speech signal with high fundamental frequency and/or corrupted by additive white noise. The method represents the glottal source wave using HMM in order to model the nonstationary properties. The nodes of HMM are concatenated in a ring state to represent the periodicity of voiced sounds. The method can accurately extract glotta...
متن کاملEstimating the voice source in noise
Estimation of the glottal source has applications in many areas of speech processing. Therefore, a noise-robust automatic source estimation algorithm is proposed in this paper. The source signal is estimated using a codebook search approach. The glottal area waveforms extracted from high-speed recordings of the glottis is converted to the glottal flow signals in order to evaluate the performanc...
متن کاملGlottal spectrum based inverse filtering
In this paper a new inverse filtering technique for the timedomain estimation of the glottal excitation is presented. This approach uses the DAP modeling for the vocal tract characterization, and a spectral model for the derivative of the glottal flow. This spectral model is based on the spectrum of the KLGLOTT88 model for the glottal source. The proposed procedure removes the glottal source fr...
متن کاملImproved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction
Since performance of conventional linear prediction (LP) deteriorates in formant estimation of high-pitched voices, several all-pole modeling methods robust to F0 have been developed. This study compares five such previously known methods and proposes a new technique, Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME). WLP-AME utilizes weighted linear prediction in which the s...
متن کامل